35 research outputs found

    Chromosomer: A Reference-Based Genome Arrangement Tool for Producing Draft Chromosome Sequences

    Get PDF
    Background: As the number of sequenced genomes rapidly increases, chromosome assembly is becoming an even more crucial step of any genome study. Since de novo chromosome assemblies are confounded by repeat-mediated artifacts, reference-assisted assemblies that use comparative inference have become widely used, prompting the development of several reference-assisted assembly programs for prokaryotic and eukaryotic genomes. Findings: We developed Chromosomer – a reference-based genome arrangement tool, which rapidly builds chromosomes from genome contigs or scaffolds using their alignments to a reference genome of a closely related species. Chromosomer does not require mate-pair libraries and it offers a number of auxiliary tools that implement common operations accompanying the genome assembly process. Conclusions: Despite implementing a straightforward alignment-based approach, Chromosomer is a useful tool for genomic analysis of species without chromosome maps. Putative chromosome assemblies by Chromosomer can be used in comparative genomic analysis, genomic variation assessment, potential linkage group inference and other kinds of analysis involving contig or scaffold mapping to a high-quality assembly

    A Moving Landscape for Comparative Genomics in Mammals

    Get PDF
    Today we count some 62,000 species of vertebrates (half are fishes) including some 550 species of mammals on earth. The genome sequencing of non-laboratory species in recent years is expanding our breadth and understanding of genetic bases of adaptation and evolution in varied and amazing ways. Recent completion and inspection of whole genome sequence and assembly for over 200 species of mammals, from platypus to panda to human, offer the prospect of a better view of the patterns of changes within genome organization across the mammalian radiations. In 2009 my colleagues and I have created Genome-10K, an international consortium of scientist who have set a goal of gathering, sequencing, assembling, and annotating to high quality some 10,000 vertebrate genomes with 2nd and 3rd generation sequencing technology within the coming five years. These activities and advances provide an enormous Bioinformatics challenge whose solution will provide future zoologists of every persuasion a genome sequence resource for their favorite study animal. The applications and potential for the genome sequence in several research questions will be discussed

    Draft de novo Genome Assembly of the Elusive Jaguarundi, Puma yagouaroundi

    Get PDF
    The Puma lineage within the family Felidae consists of 3 species that last shared a common ancestor around 4.9 million years ago. Whole-genome sequences of 2 species from the lineage were previously reported: the cheetah (Acinonyx jubatus) and the mountain lion (Puma concolor). The present report describes a whole-genome assembly of the remaining species, the jaguarundi (Puma yagouaroundi). We sequenced the genome of a male jaguarundi with 10X Genomics linked reads and assembled the whole-genome sequence. The assembled genome contains a series of scaffolds that reach the length of chromosome arms and is similar in scaffold contiguity to the genome assemblies of cheetah and puma, with a contig N50 = 100.2 kbp and a scaffold N50 = 49.27 Mbp. We assessed the assembled sequence of the jaguarundi genome using BUSCO, aligned reads of the sequenced individual and another published female jaguarundi to the assembled genome, annotated protein-coding genes, repeats, genomic variants and their effects with respect to the protein-coding genes, and analyzed differences of the 2 jaguarundis from the reference mitochondrial genome. The jaguarundi genome assembly and its annotation were compared in quality, variants, and features to the previously reported genome assemblies of puma and cheetah. Computational analyzes used in the study were implemented in transparent and reproducible way to allow their further reuse and modification. </p

    Analytical “Bake-Off” of Whole Genome Sequencing Quality for the Genome Russia Project Using a Small Cohort for Autoimmune Hepatitis

    Get PDF
    A comparative analysis of whole genome sequencing (WGS) and genotype calling was initiated for ten human genome samples sequenced by St. Petersburg State University Peterhof Sequencing Center and by three commercial sequencing centers outside of Russia. The sequence quality, efficiency of DNA variant and genotype calling were compared with each other and with DNA microarrays for each of ten study subjects. We assessed calling of SNPs, indels, copy number variation, and the speed of WGS throughput promised. Twenty separate QC analyses showed high similarities among the sequence quality and called genotypes. The ten genomes tested by the centers included eight American patients afflicted with autoimmune hepatitis (AIH), plus one case’s unaffected parents, in a prelude to discovering genetic influences in this rare disease of unknown etiology. The detailed internal replication and parallel analyses allowed the observation of two of eight AIH cases carrying a rare allele genotype for a previously described AIH-associated gene (FTCD), plus multiple occurrences of known HLA-DRB1 alleles associated with AIH (HLA-DRB1-03:01:01, 13:01:01 and 7:01:01). We also list putative SNVs in other genes as suggestive in AIH influence

    Genome-wide sequence analyses of ethnic populations across Russia

    Get PDF
    The Russian Federation is the largest and one of the most ethnically diverse countries in the world, however no centralized reference database of genetic variation exists to date. Such data are crucial for medical genetics and essential for studying population history. The Genome Russia Project aims at filling this gap by performing whole genome sequencing and analysis of peoples of the Russian Federation. Here we report the characterization of genome-wide variation of 264 healthy adults, including 60 newly sequenced samples. People of Russia carry known and novel genetic variants of adaptive, clinical and functional consequence that in many cases show allele frequency divergence from neighboring populations. Population genetics analyses revealed six phylogeographic partitions among indigenous ethnicities corresponding to their geographic locales. This study presents a characterization of population-specific genomic variation in Russia with results important for medical genetics and for understanding the dynamic population history of the world's largest country

    Genomic Legacy of the African Cheetah, Acinonyx jubatus

    Get PDF
    Background Patterns of genetic and genomic variance are informative in inferring population history for human, model species and endangered populations. Results Here the genome sequence of wild-born African cheetahs reveals extreme genomic depletion in SNV incidence, SNV density, SNVs of coding genes, MHC class I and II genes, and mitochondrial DNA SNVs. Cheetah genomes are on average 95 % homozygous compared to the genomes of the outbred domestic cat (24.08 % homozygous), Virunga Mountain Gorilla (78.12 %), inbred Abyssinian cat (62.63 %), Tasmanian devil, domestic dog and other mammalian species. Demographic estimators impute two ancestral population bottlenecks: one \u3e100,000 years ago coincident with cheetah migrations out of the Americas and into Eurasia and Africa, and a second 11,084–12,589 years ago in Africa coincident with late Pleistocene large mammal extinctions. MHC class I gene loss and dramatic reduction in functional diversity of MHC genes would explain why cheetahs ablate skin graft rejection among unrelated individuals. Significant excess of non-synonymous mutations in AKAP4 (p\u3c0.02), a gene mediating spermatozoon development, indicates cheetah fixation of five function-damaging amino acid variants distinct from AKAP4 homologues of other Felidae or mammals; AKAP4 dysfunction may cause the cheetah’s extremely high (\u3e80 %) pleiomorphic sperm. Conclusions The study provides an unprecedented genomic perspective for the rare cheetah, with potential relevance to the species’ natural history, physiological adaptations and unique reproductive disposition

    Pangolin genomes and the evolution of mammalian scales and immunity

    Get PDF
    Pangolins, unique mammals with scales over most of their body, no teeth, poor vision, and an acute olfactory system, comprise the only placental order (Pholidota) without a whole-genome map. To investigate pangolin biology and evolution, we developed genome assemblies of the Malayan (Manis javanica) and Chinese (M. pentadactyla) pangolins. Strikingly, we found that interferon epsilon (IFNE), exclusively expressed in epithelial cells and important in skin and mucosal immunity, is pseudogenized in all African and Asian pangolin species that we examined, perhaps impacting resistance to infection. We propose that scale development was an innovation that provided protection against injuries or stress and reduced pangolin vulnerability to infection. Further evidence of specialized adaptations was evident from positively selected genes involving immunity-related pathways, inflammation, energy storage and metabolism, muscular and nervous systems, and scale/hair development. Olfactory receptor gene families are significantly expanded in pangolins, reflecting their well-developed olfaction system. This study provides insights into mammalian adaptation and functional diversification, new research tools and questions, and perhaps a new natural IFNE-deficient animal model for studying mammalian immunity.University of Malaya and Ministry of Education, Malaysia [UM.C/HIR/MOHE/08]; UMRG grant from the University of Malaya and Ministry of Education, Malaysia [RG541-13HTM]; Russian Ministry of Science [11.G34.31.0068]; NIH-NHGRI grant [5U54HG00307907]SCI(E)[email protected]

    HalSynteny: A fast, easy-to-use conserved synteny block construction method for multiple whole-genome alignments

    No full text
    Background: Large-scale sequencing projects provide high-quality full-genome data that can be used for reconstruction of chromosomal exchanges and rearrangements that disrupt conserved syntenic blocks. The highest resolution of cross-species homology can be obtained on the basis of whole-genome, reference-free alignments. Very large multiple alignments of full-genome sequence stored in a binary format demand an accurate and efficient computational approach for synteny block production. Findings: halSynteny performs efficient processing of pairwise alignment blocks for any pair of genomes in the alignment. The tool is part of the HAL comparative genomics suite and is targeted to build synteny blocks for multi-hundred-way, reference-free vertebrate alignments built with the Cactus system. Conclusions: halSynteny enables an accurate and rapid identification of synteny in multiple full-genome alignments. The method is implemented in C++11 as a component of the halTools software and released under MIT license. The package is available at https://github.com/ComparativeGenomicsToolkit/hal/

    Whole-Genome Analysis of Mycobacterium tuberculosis from Patients with Tuberculous Spondylitis, Russia

    No full text
    Whole-genome analysis of Mycobacterium tuberculosis isolates collected in Russia (N = 71) from patients with tuberculous spondylitis supports a detailed characterization of pathogen strain distributions and drug resistance phenotype, plus distinguished occurrence and association of known resistance mutations. We identify known and novel genome determinants related to bacterial virulence, pathogenicity, and drug resistance

    Genomic Variations in Drug Resistant Mycobacterium tuberculosis Strains Collected from Patients with Different Localization of Infection

    No full text
    Mycobacterium tuberculosis is a highly studied pathogen due to public health importance. Despite this, problems like early drug resistance, diagnostics and treatment success prediction are still not fully resolved. Here, we analyze the incidence of point mutations widely used for drug resistance detection in laboratory practice and conduct comparative analysis of whole-genome sequence (WGS) for clinical M. tuberculosis strains collected from patients with pulmonary tuberculosis (PTB) and extra-pulmonary tuberculosis (XPTB) localization. A total of 72 pulmonary and 73 extrapulmonary microbiologically characterized M. tuberculosis isolates were collected from patients from 2007 to 2014 in Russia. Genomic DNA was used for WGS and obtained data allowed identifying major mutations known to be associated with drug resistance to first-line and second-line antituberculous drugs. In some cases previously described mutations were not identified. Using genome-based phylogenetic analysis we identified M. tuberculosis substrains associated with distinctions in the occurrence in PTB vs. XPTB cases. Phylogenetic analyses did reveal M. tuberculosis genetic substrains associated with TB localization. XPTB was associated with Beijing sublineages Central Asia (Beijing CAO), Central Asia Clade A (Beijing A) and 4.8 groups, while PTB localization was associated with group LAM (4.3). Further, the XPTB strain in some cases showed elevated drug resistance patterns relative to PTB isolates. HIV was significantly associated with the development of XPTB in the Beijing B0/W148 group and among unclustered Beijing isolates
    corecore